Dataset statistics
| Number of variables | 8 |
|---|---|
| Number of observations | 753 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 47.2 KiB |
| Average record size in memory | 64.2 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 2 |
District has a high cardinality: 77 distinct values | High cardinality |
Local Level Name has a high cardinality: 737 distinct values | High cardinality |
Total family number is highly correlated with Total household number and 3 other fields | High correlation |
Total household number is highly correlated with Total family number and 3 other fields | High correlation |
Total population is highly correlated with Total family number and 3 other fields | High correlation |
Total Male is highly correlated with Total family number and 3 other fields | High correlation |
Total Female is highly correlated with Total family number and 3 other fields | High correlation |
Total family number is highly correlated with Total household number and 3 other fields | High correlation |
Total household number is highly correlated with Total family number and 3 other fields | High correlation |
Total population is highly correlated with Total family number and 3 other fields | High correlation |
Total Male is highly correlated with Total family number and 3 other fields | High correlation |
Total Female is highly correlated with Total family number and 3 other fields | High correlation |
Total family number is highly correlated with Total household number and 3 other fields | High correlation |
Total household number is highly correlated with Total family number and 3 other fields | High correlation |
Total population is highly correlated with Total family number and 3 other fields | High correlation |
Total Male is highly correlated with Total family number and 3 other fields | High correlation |
Total Female is highly correlated with Total family number and 3 other fields | High correlation |
_id is highly correlated with District | High correlation |
District is highly correlated with _id and 5 other fields | High correlation |
Total family number is highly correlated with District and 4 other fields | High correlation |
Total household number is highly correlated with District and 4 other fields | High correlation |
Total population is highly correlated with District and 4 other fields | High correlation |
Total Male is highly correlated with District and 4 other fields | High correlation |
Total Female is highly correlated with District and 4 other fields | High correlation |
_id is uniformly distributed | Uniform |
Local Level Name is uniformly distributed | Uniform |
_id has unique values | Unique |
Reproduction
| Analysis started | 2022-04-20 13:28:53.519164 |
|---|---|
| Analysis finished | 2022-04-20 13:29:00.498161 |
| Duration | 6.98 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 753 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 377 |
| Minimum | 1 |
|---|---|
| Maximum | 753 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 38.6 |
| Q1 | 189 |
| median | 377 |
| Q3 | 565 |
| 95-th percentile | 715.4 |
| Maximum | 753 |
| Range | 752 |
| Interquartile range (IQR) | 376 |
Descriptive statistics
| Standard deviation | 217.516666 |
|---|---|
| Coefficient of variation (CV) | 0.5769672839 |
| Kurtosis | -1.2 |
| Mean | 377 |
| Median Absolute Deviation (MAD) | 188 |
| Skewness | 0 |
| Sum | 283881 |
| Variance | 47313.5 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 507 | 1 | 0.1% |
| 498 | 1 | 0.1% |
| 499 | 1 | 0.1% |
| 500 | 1 | 0.1% |
| 501 | 1 | 0.1% |
| 502 | 1 | 0.1% |
| 503 | 1 | 0.1% |
| 504 | 1 | 0.1% |
| 505 | 1 | 0.1% |
| Other values (743) | 743 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 753 | 1 | |
| 752 | 1 | |
| 751 | 1 | |
| 750 | 1 | |
| 749 | 1 | |
| 748 | 1 | |
| 747 | 1 | |
| 746 | 1 | |
| 745 | 1 | |
| 744 | 1 |
| Distinct | 77 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 KiB |
| Sarlahi | 20 |
|---|---|
| Dhanusa | 18 |
| Rautahat | 18 |
| Saptari | 18 |
| Morang | 17 |
| Other values (72) |
Length
| Max length | 16 |
|---|---|
| Median length | 7 |
| Mean length | 7.460823373 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5618 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Taplejung |
|---|---|
| 2nd row | Taplejung |
| 3rd row | Taplejung |
| 4th row | Taplejung |
| 5th row | Taplejung |
Common Values
| Value | Count | Frequency (%) |
| Sarlahi | 20 | 2.7% |
| Dhanusa | 18 | 2.4% |
| Rautahat | 18 | 2.4% |
| Saptari | 18 | 2.4% |
| Morang | 17 | 2.3% |
| Siraha | 17 | 2.3% |
| Rupandehi | 16 | 2.1% |
| Bara | 16 | 2.1% |
| Jhapa | 15 | 2.0% |
| Mahottari | 15 | 2.0% |
| Other values (67) | 583 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| sarlahi | 20 | 2.6% |
| saptari | 18 | 2.3% |
| dhanusa | 18 | 2.3% |
| rautahat | 18 | 2.3% |
| siraha | 17 | 2.2% |
| morang | 17 | 2.2% |
| rupandehi | 16 | 2.1% |
| bara | 16 | 2.1% |
| jhapa | 15 | 1.9% |
| mahottari | 15 | 1.9% |
| Other values (67) | 607 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1202 | |
| h | 448 | 8.0% |
| u | 362 | 6.4% |
| n | 315 | 5.6% |
| i | 291 | 5.2% |
| r | 290 | 5.2% |
| l | 266 | 4.7% |
| t | 254 | 4.5% |
| p | 190 | 3.4% |
| k | 168 | 3.0% |
| Other values (33) | 1832 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4817 | |
| Uppercase Letter | 777 | 13.8% |
| Space Separator | 24 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1202 | |
| h | 448 | 9.3% |
| u | 362 | 7.5% |
| n | 315 | 6.5% |
| i | 291 | 6.0% |
| r | 290 | 6.0% |
| l | 266 | 5.5% |
| t | 254 | 5.3% |
| p | 190 | 3.9% |
| k | 168 | 3.5% |
| Other values (12) | 1031 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 136 | |
| D | 101 | |
| B | 86 | |
| K | 80 | |
| R | 66 | |
| M | 61 | |
| P | 48 | 6.2% |
| J | 30 | 3.9% |
| N | 27 | 3.5% |
| T | 25 | 3.2% |
| Other values (10) | 117 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5594 | |
| Common | 24 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1202 | |
| h | 448 | 8.0% |
| u | 362 | 6.5% |
| n | 315 | 5.6% |
| i | 291 | 5.2% |
| r | 290 | 5.2% |
| l | 266 | 4.8% |
| t | 254 | 4.5% |
| p | 190 | 3.4% |
| k | 168 | 3.0% |
| Other values (32) | 1808 |
Common
| Value | Count | Frequency (%) |
| 24 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5618 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1202 | |
| h | 448 | 8.0% |
| u | 362 | 6.4% |
| n | 315 | 5.6% |
| i | 291 | 5.2% |
| r | 290 | 5.2% |
| l | 266 | 4.7% |
| t | 254 | 4.5% |
| p | 190 | 3.4% |
| k | 168 | 3.0% |
| Other values (33) | 1832 |
| Distinct | 737 |
|---|---|
| Distinct (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 KiB |
| Sunkoshi Rural Municipality | 3 |
|---|---|
| Tribeni Rural Municipality | 3 |
| Bishnupur Rural Municipality | 2 |
| Musikot Municipality | 2 |
| Mahalaxmi Municipality | 2 |
| Other values (732) |
Length
| Max length | 43 |
|---|---|
| Median length | 26 |
| Mean length | 26.02257636 |
| Min length | 16 |
Characters and Unicode
| Total characters | 19595 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 723 ? |
|---|---|
| Unique (%) | 96.0% |
Sample
| 1st row | Aathrai Tribeni Rural Municipality |
|---|---|
| 2nd row | Maiwakhola Rural Municipality |
| 3rd row | Meringden Rural Municipality |
| 4th row | Mikwakhola Rural Municipality |
| 5th row | Phaktanglung Rural Municipality |
Common Values
| Value | Count | Frequency (%) |
| Sunkoshi Rural Municipality | 3 | 0.4% |
| Tribeni Rural Municipality | 3 | 0.4% |
| Bishnupur Rural Municipality | 2 | 0.3% |
| Musikot Municipality | 2 | 0.3% |
| Mahalaxmi Municipality | 2 | 0.3% |
| Likhu Rural Municipality | 2 | 0.3% |
| Miklajung Rural Municipality | 2 | 0.3% |
| Madi Municipality | 2 | 0.3% |
| Malika Rural Municipality | 2 | 0.3% |
| Madi Rural Municipality | 2 | 0.3% |
| Other values (727) | 731 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| municipality | 736 | |
| rural | 461 | |
| city | 17 | 0.8% |
| sub-metropolitian | 11 | 0.5% |
| tribeni | 7 | 0.3% |
| metropolitian | 6 | 0.3% |
| madi | 4 | 0.2% |
| rapti | 3 | 0.1% |
| bagmati | 3 | 0.1% |
| bheri | 3 | 0.1% |
| Other values (781) | 809 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2837 | |
| a | 2654 | |
| u | 1556 | 7.9% |
| l | 1424 | 7.3% |
| 1307 | 6.7% | |
| n | 1177 | 6.0% |
| t | 984 | 5.0% |
| r | 908 | 4.6% |
| p | 879 | 4.5% |
| M | 837 | 4.3% |
| Other values (40) | 5032 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16199 | |
| Uppercase Letter | 2077 | 10.6% |
| Space Separator | 1307 | 6.7% |
| Dash Punctuation | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2837 | |
| a | 2654 | |
| u | 1556 | |
| l | 1424 | |
| n | 1177 | |
| t | 984 | 6.1% |
| r | 908 | 5.6% |
| p | 879 | 5.4% |
| y | 829 | 5.1% |
| c | 796 | 4.9% |
| Other values (15) | 2155 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 837 | |
| R | 505 | |
| B | 126 | 6.1% |
| S | 108 | 5.2% |
| K | 80 | 3.9% |
| C | 60 | 2.9% |
| P | 56 | 2.7% |
| D | 53 | 2.6% |
| T | 51 | 2.5% |
| G | 48 | 2.3% |
| Other values (13) | 153 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1307 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18276 | |
| Common | 1319 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2837 | |
| a | 2654 | |
| u | 1556 | 8.5% |
| l | 1424 | 7.8% |
| n | 1177 | 6.4% |
| t | 984 | 5.4% |
| r | 908 | 5.0% |
| p | 879 | 4.8% |
| M | 837 | 4.6% |
| y | 829 | 4.5% |
| Other values (38) | 4191 |
Common
| Value | Count | Frequency (%) |
| 1307 | ||
| - | 12 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19595 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2837 | |
| a | 2654 | |
| u | 1556 | 7.9% |
| l | 1424 | 7.3% |
| 1307 | 6.7% | |
| n | 1177 | 6.0% |
| t | 984 | 5.0% |
| r | 908 | 4.6% |
| p | 879 | 4.5% |
| M | 837 | 4.3% |
| Other values (40) | 5032 |
Total family number
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 736 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8976.723772 |
| Minimum | 125 |
|---|---|
| Maximum | 231714 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 125 |
|---|---|
| 5-th percentile | 2159.2 |
| Q1 | 4036 |
| median | 5896 |
| Q3 | 9510 |
| 95-th percentile | 23710.2 |
| Maximum | 231714 |
| Range | 231589 |
| Interquartile range (IQR) | 5474 |
Descriptive statistics
| Standard deviation | 12986.55224 |
|---|---|
| Coefficient of variation (CV) | 1.446691752 |
| Kurtosis | 134.0640627 |
| Mean | 8976.723772 |
| Median Absolute Deviation (MAD) | 2341 |
| Skewness | 9.434370137 |
| Sum | 6759473 |
| Variance | 168650539 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5305 | 3 | 0.4% |
| 3627 | 2 | 0.3% |
| 8034 | 2 | 0.3% |
| 4031 | 2 | 0.3% |
| 5148 | 2 | 0.3% |
| 6812 | 2 | 0.3% |
| 3192 | 2 | 0.3% |
| 4822 | 2 | 0.3% |
| 5195 | 2 | 0.3% |
| 4143 | 2 | 0.3% |
| Other values (726) | 732 |
| Value | Count | Frequency (%) |
| 125 | 1 | |
| 320 | 1 | |
| 391 | 1 | |
| 456 | 1 | |
| 459 | 1 | |
| 471 | 1 | |
| 540 | 1 | |
| 543 | 1 | |
| 558 | 1 | |
| 600 | 1 |
| Value | Count | Frequency (%) |
| 231714 | 1 | |
| 143137 | 1 | |
| 98288 | 1 | |
| 77872 | 1 | |
| 57383 | 1 | |
| 51619 | 1 | |
| 51099 | 1 | |
| 50743 | 1 | |
| 47218 | 1 | |
| 47169 | 1 |
Total household number
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 720 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7493.590969 |
| Minimum | 125 |
|---|---|
| Maximum | 105649 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 125 |
|---|---|
| 5-th percentile | 1995.2 |
| Q1 | 3653 |
| median | 5195 |
| Q3 | 8151 |
| 95-th percentile | 19439 |
| Maximum | 105649 |
| Range | 105524 |
| Interquartile range (IQR) | 4498 |
Descriptive statistics
| Standard deviation | 8401.892705 |
|---|---|
| Coefficient of variation (CV) | 1.121210477 |
| Kurtosis | 54.11716468 |
| Mean | 7493.590969 |
| Median Absolute Deviation (MAD) | 2004 |
| Skewness | 5.946940988 |
| Sum | 5642674 |
| Variance | 70591801.03 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2735 | 2 | 0.3% |
| 6306 | 2 | 0.3% |
| 2362 | 2 | 0.3% |
| 1364 | 2 | 0.3% |
| 2271 | 2 | 0.3% |
| 4795 | 2 | 0.3% |
| 3421 | 2 | 0.3% |
| 3652 | 2 | 0.3% |
| 4897 | 2 | 0.3% |
| 4231 | 2 | 0.3% |
| Other values (710) | 733 |
| Value | Count | Frequency (%) |
| 125 | 1 | |
| 306 | 1 | |
| 333 | 1 | |
| 441 | 1 | |
| 444 | 1 | |
| 454 | 1 | |
| 511 | 1 | |
| 539 | 1 | |
| 542 | 1 | |
| 548 | 1 |
| Value | Count | Frequency (%) |
| 105649 | 1 | |
| 101669 | 1 | |
| 77838 | 1 | |
| 49044 | 1 | |
| 45240 | 1 | |
| 45204 | 1 | |
| 41012 | 1 | |
| 40207 | 1 | |
| 39425 | 1 | |
| 39249 | 1 |
Total population
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 747 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38612.20452 |
| Minimum | 398 |
|---|---|
| Maximum | 845767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 398 |
|---|---|
| 5-th percentile | 8931.2 |
| Q1 | 16768 |
| median | 25508 |
| Q3 | 44929 |
| 95-th percentile | 100099.4 |
| Maximum | 845767 |
| Range | 845369 |
| Interquartile range (IQR) | 28161 |
Descriptive statistics
| Standard deviation | 49904.37721 |
|---|---|
| Coefficient of variation (CV) | 1.292450867 |
| Kurtosis | 106.3215051 |
| Mean | 38612.20452 |
| Median Absolute Deviation (MAD) | 11398 |
| Skewness | 8.168110664 |
| Sum | 29074990 |
| Variance | 2490446865 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 15237 | 2 | 0.3% |
| 23891 | 2 | 0.3% |
| 18751 | 2 | 0.3% |
| 10205 | 2 | 0.3% |
| 42528 | 2 | 0.3% |
| 13900 | 2 | 0.3% |
| 19835 | 1 | 0.1% |
| 57341 | 1 | 0.1% |
| 23124 | 1 | 0.1% |
| 17077 | 1 | 0.1% |
| Other values (737) | 737 |
| Value | Count | Frequency (%) |
| 398 | 1 | |
| 1272 | 1 | |
| 1535 | 1 | |
| 1584 | 1 | |
| 1671 | 1 | |
| 1713 | 1 | |
| 1997 | 1 | |
| 2462 | 1 | |
| 2581 | 1 | |
| 2611 | 1 |
| Value | Count | Frequency (%) |
| 845767 | 1 | |
| 518452 | 1 | |
| 369377 | 1 | |
| 299843 | 1 | |
| 268273 | 1 | |
| 244750 | 1 | |
| 204788 | 1 | |
| 201079 | 1 | |
| 198098 | 1 | |
| 195951 | 1 |
| Distinct | 746 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18842.81408 |
| Minimum | 170 |
|---|---|
| Maximum | 431501 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 170 |
|---|---|
| 5-th percentile | 4448.6 |
| Q1 | 8115 |
| median | 12248 |
| Q3 | 22100 |
| 95-th percentile | 48053.4 |
| Maximum | 431501 |
| Range | 431331 |
| Interquartile range (IQR) | 13985 |
Descriptive statistics
| Standard deviation | 24891.92856 |
|---|---|
| Coefficient of variation (CV) | 1.321030312 |
| Kurtosis | 114.4046473 |
| Mean | 18842.81408 |
| Median Absolute Deviation (MAD) | 5539 |
| Skewness | 8.481348417 |
| Sum | 14188639 |
| Variance | 619608107.4 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10214 | 2 | 0.3% |
| 12169 | 2 | 0.3% |
| 9626 | 2 | 0.3% |
| 6920 | 2 | 0.3% |
| 6158 | 2 | 0.3% |
| 10670 | 2 | 0.3% |
| 5713 | 2 | 0.3% |
| 34316 | 1 | 0.1% |
| 8628 | 1 | 0.1% |
| 35180 | 1 | 0.1% |
| Other values (736) | 736 |
| Value | Count | Frequency (%) |
| 170 | 1 | |
| 728 | 1 | |
| 748 | 1 | |
| 818 | 1 | |
| 831 | 1 | |
| 839 | 1 | |
| 1132 | 1 | |
| 1206 | 1 | |
| 1289 | 1 | |
| 1330 | 1 |
| Value | Count | Frequency (%) |
| 431501 | 1 | |
| 250999 | 1 | |
| 179744 | 1 | |
| 150702 | 1 | |
| 139849 | 1 | |
| 122769 | 1 | |
| 102334 | 1 | |
| 100417 | 1 | |
| 97324 | 1 | |
| 95705 | 1 |
| Distinct | 743 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19769.39044 |
| Minimum | 228 |
|---|---|
| Maximum | 414266 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 KiB |
Quantile statistics
| Minimum | 228 |
|---|---|
| 5-th percentile | 4450.2 |
| Q1 | 8553 |
| median | 13010 |
| Q3 | 23068 |
| 95-th percentile | 51533.2 |
| Maximum | 414266 |
| Range | 414038 |
| Interquartile range (IQR) | 14515 |
Descriptive statistics
| Standard deviation | 25046.36215 |
|---|---|
| Coefficient of variation (CV) | 1.266926374 |
| Kurtosis | 98.60091807 |
| Mean | 19769.39044 |
| Median Absolute Deviation (MAD) | 5726 |
| Skewness | 7.861075579 |
| Sum | 14886351 |
| Variance | 627320256.8 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 8410 | 3 | 0.4% |
| 27159 | 2 | 0.3% |
| 14711 | 2 | 0.3% |
| 8054 | 2 | 0.3% |
| 8816 | 2 | 0.3% |
| 8480 | 2 | 0.3% |
| 11467 | 2 | 0.3% |
| 10910 | 2 | 0.3% |
| 8446 | 2 | 0.3% |
| 28529 | 1 | 0.1% |
| Other values (733) | 733 |
| Value | Count | Frequency (%) |
| 228 | 1 | |
| 544 | 1 | |
| 766 | 1 | |
| 787 | 1 | |
| 840 | 1 | |
| 865 | 1 | |
| 874 | 1 | |
| 1256 | 1 | |
| 1275 | 1 | |
| 1292 | 1 |
| Value | Count | Frequency (%) |
| 414266 | 1 | |
| 267453 | 1 | |
| 189633 | 1 | |
| 149141 | 1 | |
| 128424 | 1 | |
| 121981 | 1 | |
| 106695 | 1 | |
| 103790 | 1 | |
| 102454 | 1 | |
| 99349 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| _id | District | Local Level Name | Total family number | Total household number | Total population | Total Male | Total Female | |
|---|---|---|---|---|---|---|---|---|
| 0 | 1 | Taplejung | Aathrai Tribeni Rural Municipality | 2869 | 2735 | 12288 | 6005 | 6283 |
| 1 | 2 | Taplejung | Maiwakhola Rural Municipality | 2275 | 2178 | 10365 | 5264 | 5101 |
| 2 | 3 | Taplejung | Meringden Rural Municipality | 2683 | 2528 | 12040 | 6181 | 5859 |
| 3 | 4 | Taplejung | Mikwakhola Rural Municipality | 1862 | 1792 | 7991 | 4000 | 3991 |
| 4 | 5 | Taplejung | Phaktanglung Rural Municipality | 2864 | 2700 | 11925 | 6239 | 5686 |
| 5 | 6 | Taplejung | Phungling Municipality | 7306 | 5888 | 28786 | 14160 | 14626 |
| 6 | 7 | Taplejung | Sidingba Rural Municipality | 2604 | 2484 | 10981 | 5593 | 5388 |
| 7 | 8 | Taplejung | Sirijangha Rural Municipality | 3329 | 3197 | 14186 | 7227 | 6959 |
| 8 | 9 | Taplejung | Pathivara Yangwarak Rural Municipality | 2738 | 2637 | 11797 | 5855 | 5942 |
| 9 | 10 | Panchthar | Falelung Rural Municipality | 4940 | 4773 | 20531 | 10211 | 10320 |
Last rows
| _id | District | Local Level Name | Total family number | Total household number | Total population | Total Male | Total Female | |
|---|---|---|---|---|---|---|---|---|
| 743 | 744 | Baitadi | Surnaya Rural Municipality | 3877 | 3206 | 18230 | 8549 | 9681 |
| 744 | 745 | Darchula | Apihimal Rural Municipality | 1363 | 1256 | 7023 | 3438 | 3585 |
| 745 | 746 | Darchula | Byas Rural Municipality | 2361 | 2168 | 10205 | 4884 | 5321 |
| 746 | 747 | Darchula | Dunhu Rural Municipality | 2293 | 1906 | 9912 | 4580 | 5332 |
| 747 | 748 | Darchula | Lekam Rural Municipality | 3045 | 2645 | 13638 | 6520 | 7118 |
| 748 | 749 | Darchula | Mahakali Municipality | 6080 | 4596 | 24572 | 12072 | 12500 |
| 749 | 750 | Darchula | Malikaarjun Rural Municipality | 3241 | 2993 | 15754 | 7631 | 8123 |
| 750 | 751 | Darchula | Marma Rural Municipality | 3086 | 2704 | 15586 | 7551 | 8035 |
| 751 | 752 | Darchula | Naugad Rural Municipality | 3156 | 2920 | 16434 | 7988 | 8446 |
| 752 | 753 | Darchula | Shailyashikhar Municipality | 4561 | 4012 | 21932 | 10685 | 11247 |